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(54) Hepatocyte growth factor activator inhibitor 

(57) A novel protein having inhibitory activity on the 
protease activity of hepatocyte growth factor (HGF) acti- 
vator was purified and isolated, and its molecular weight 
(ca. 30,000 daltons) and partial amino acid sequence 
were determined. A gene coding for the protein was 
cloned, and the gene DNA was incorporated into a vec- 
tor, for transforming host cells. Cultivation of the trans- 
formant gave the desired protein The protein can be 
used as an in vivo or in vitro control factor for HGF or 
HGF activator It is also useful as an antigen to be used 
m producing an antibody to be used as means for 
kinetic studies of the protein, or as a standard in assay 
systems therefor. 
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Description 

FIELD OF THE INVENTION 

The present invention relates to a novel protein and a DNA coding for the same. More particularly, it relates to a 
novel protein having inhibitory activity on the protease activity of hepatocyle growth factor activating factor (HGF acti- 
vator) (hereinafter this protein in sometimes referred to also as "HAI-II"), a gene coding for the protein, an expression 
vector containing the gene, a transformant as transformed with the expression vector, and a method of producing HAI- 
II using the transformant 

BACKGROUND OF THE INVENTION 

It is already reported that thrombin activates the precursor of HGF activator (JP-A-5-103670, JP-A-6-141859, JP- 
A-6-153946 and JP-A-6-153966 (the term "JP-A" as used herein means an "unexamined published Japanese patent 
application'*)); factor having activity to convert the single chain form of hepatocyte growth factor (HGF) to its double 
chain form) in the manner of positive activity control. However, any human tissues-derived protease inhibitor capable of 
inhibiting, as a negative control factor, the physiological activity of HGF activator has not been known. Therefore, how 
HGF activator is controlled in human tissues remains unknown. Such a negative control factor might also influence indi- 
rectly on the activity of hepatocyte growth factor (HGF) on which HGF activator acts. Thus, for the analysis of the mech- 
anism of action of HGF in vivo , as well, it has been demanded that such a human tissues-derived protease inhibitor be 
isolated and identified. 

By using such a protease inhibitor and an antibody to the protease inhibitor, it would become possible to know the 
in vivo physiological activity of HGF activator, analyze the mechanism of action thereof or analyze the mechanism of 
control of HGF activation, from a standpoint different from those of the prior art. 

Furthermore, for investigating the detailed in vivo function of HAI-II or the effect of HAI-II in hepatic disorder, for 
instance, HAI-II is required in large quantities. At present, however, there is only one method available for preparing 
HAI-II, which method comprises using, as a starting material, the culture supernatant obtained with a human cancer 
cell line such as MKN45 or A549 cells and purifying therefrom HAI-II occurring therein in trace amounts. This method 
is not always the best one from the labor, time and cost viewpoint. It encounters great difficulties in stably isolating the 
minor amount of HAI-II alone. Therefore, it has been desired that an expression system be constructed so that HAI-II 
can be obtained stably and in large quantities. 

SUMMARY OF THE INVENTION 

The present inventors have conducted screening of various cultured cell lines using, as an indicator, the inhibitory 
activity on the protease activity of hepatocyte growth factor activator and have found that a substance having the activity 
occurs in the culture supernatant of certain human cancer cell lines (MKN45 cells, A549 cells and like epithelial tumor 
cell lines). To reveal the nature of its inhibitory activity, they further attempted to purify the substance from the MKN45 
cell culture supernatant using various column chromatography techniques. As a result, they have found a novel protein 
with a molecular weight of about 30,000 daltons as determined by SDS (sodium dedecyl sulfate)-polyacrylamide gel 
electrophoresis(PAGE) and, they also have obtained an ammo-terminal amino acid sequence of this protein by analyz- 
ing the protein on a protein sequencer. Further, they determined partial ammo acid sequences by decomposing the pro- 
tein using proteolytic enzymes, isolating the resultant peptides and subjecting each peptide to the same amino acid 
sequence analysis as mentioned above. Furthermore, they estimated DNA base sequences based on the partial amino 
acid sequences and conducted screening of a cDNA library using oligonucleotide probes prepared based on the 
sequences. As a result, they have succeeded in cloning a gene coding for the protein and have now completed the 
present invention. 

Furthermore, as a result of various investigations to produce the protein stably and in large quantities using the 
recombinant DNA technique and, the present inventors have constructed a novel expression vector coding for the pro- 
tein and have enabled expression of the protein. Thus, by constructing a plasmid for protein expression by inserting a 
DNA fragment coding for part or the whole of the amino acid sequence of the protein into a plasmid vector such as the 
expression vector pME18S for use in animal cells or an expression vector for use in yeasts, Escherichia qq\\ and the 
like, at a site downstream from the promoter thereof and using the thus-obtained recombinant plasmid to transform host 
cells, they have now completed the present invention in another aspect. 

The present invention thus relates to a protein having the following physico-chemical properties: 

(1) a molecular weight of about 30,000 daltons as determined by SDS-polyacrylamide gel electrophoresis; 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino acid sequences depicted in the sequence listing under SEQ ID NO l through 3 or an ammo 
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acid sequence substantially equivalent thereto proteins respectively having the amino acid sequences depicted m 
the sequence listing under SEQ ID N01 through 3 or ammo acid sequences substantially equivalent thereto and 
having inhibitory activity on the protease activity of hepatocyte growth factor activator; a protein having the amino 
acid sequence depicted in the sequence listing under SEQ ID NO A or on amino acid sequence substantially equiv- 
alent thereto; and a protein having, as its ammo acid sequence, that segment of the amino acid sequence depicted 
m the sequence listing under SEQ ID N0 4 which starts with the 28th ammo acid (alanine) residue and ends with 
the 252nd amino acid (leucine) residue, or an amino acid sequence substantially equivalent thereto, DNAs and 
genes coding for the proteins defined above; expression vectors respectively containing the DNA or genes; 
transformants obtained by transformation of host cells with the expression vectors; as well as a method of produc- 
ing proteins having inhibitory activity on the protease activity of hepatocyte growth factor activator which comprises 
cultivating the transformants. 

The base sequence shown in the sequence listing under SEQ ID NO;4 contains only one strand, with the other 
complementary base sequence being omitted. Starting with this gone and using the recombinant DNA technology, it is 
possible to cause expression of, for example, the protein having the amino acid sequence shown in the sequence listing 
under SEQ ID NO:4. On that occasion, the protein translated from mRNA coding for the protein contains a signal 
sequence. After extracellular excretion, however, the signal sequence has been cleaved off and the protein obtained 
has an amino acid sequence comprising the 28th amino acid (alanine) residue end the subsequent amino acid residues 
of the ammo acid sequence shown in the sequence listing under SEQ ID NO;4. Signal sequences of other proteins may 
also be used as the signal sequence. For signal sequence-free mature protein expression in host cells, a gene having 
that portion of the base sequence shown in the sequence listing under SEQ ID NO:4 which comprises the 82nd nucle- 
otide (guanine) residue and the subsequent nucleotide residues may be used as the gene coding for the relevant pro- 
tein and joined to the ATG codon of a vector. The present invention further includes, within the scope thereof, 
modifications of the proteins or DNAs mentioned above as derived therefrom by deletion, substitution and/or addition 
of one or more amino acid or nucleotide residues within limits not harmful to the inhibitory activity on the protease activ- 
ity of HGF activator, namely those proteins or DNAs that respectively have "substantially equivalent ammo acid 
sequences" or "substantially equivalent base sequences". 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the results of assaying of the protein of the present application for its inhibitory activity on the protease 
activity of HGF activator. 

Fig. 2 shows the structure of the plasmid pME18S-HAI-ll. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following, the present invention is described in further detail. The novel protein of the present invention which 
has protease inhibitor activity can be obtained by proceeding via such purification steps as mentioned below For exam- 
ple, a human cancer cell line (MKN45 cells or A549 cells deposited with the Japanese Cancer Research Resources 
Bank under the deposite numbers JCRB0254 and JCRB0076, respectively, or like epitherlial tumor cell line) is culti- 
vated in a serum-free medium for several days, the culture supernatant is recovered and. after removal of cells there- 
from and concentration, submitted to a hepann-Sepharose column (available e.g. from Pharmacia). The non-adsorbed 
fraction is submitted to a ConA-Sepharose column (available e g from Pharmacia) and separated into an adsorbed 
fraction and an non-adsorbed fraction. The non-adsorbed fraction is subjected to hydrophobic chromatography using 
Phenyl-5PW (available e.g. from Tosoh Corp.). The thus-obtained fraction containing the desired protein is chromato- 
graphed on a DEAE ion exchange column (available e.g. from Polymer Laboratory), then submitted to a hydroxyapatite 
column (available e.g. from Mitsui Toatsu Chemicals or Seikagaku Corp.). and further to gel filtration column chroma- 
tography (using e.g. Asahi Chemical Industry's GS520) to give the protein in question The purification steps may fur- 
,k o' nr!i<1e ^versed phase r oiumn chromatography and/or other appropriate means as necessary 
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jected to amino acid sequence analysis in the same manner as mentioned above, whereby the amino acid sequence 
of an intermediate portion of the protein can be revealed 

A DNA base sequence is deduced from the amino acid sequence thus determined and, correspondingly, appropri- 
ate oligonucleotides are synthesized and used as probes Human-derived liver, spleen and placenta cDNA libraries 
(available e.g. from Clonetec), among others, can be used as the cDNA library for screening out the gene coding for the 
desired protein. In addition, a cDNA library may be constructed in the conventional manner from & cell line or tissue 
material in which the protein is expressed. 

Escherichia coli is transfected with >. phage containing such cDNA incorporated therein (the method of Maniatis et 
al "Molecular Cloning ") and then cultivated. The plaques formed are subjected to selection by plaque hybridization 
using oligonucleotide probes prepared based on the base sequence deduced from the amino acid sequence of a por- 
tion of the protein in question, whereby a certain number of different >. phage clones having the amino acid sequence 
of the desired protein and containing, m addition, those segment base sequences of the protein that correspond to 
other regions than the probes can be obtained with ease. 

Then, the phage from each positive plaque obtained in the above screening is then allowed to replicate by the 
method of Maniatis et al . and DNA is purified therefrom by the glycerol gradient method and, after appropriate restric- 
tion enzyme cleavage, submitted to cDNA subcloning into a plasmid vector such as pUCl8 or pUC19 or a single chain 
phage vector such as Ml3mpl8 or M13mp19. Thereafter, the base sequence of the desired cDNA fragment can be 
determined by the method of Sanger et al. The base sequences of the clones obtained are analyzed and synthesized 
and, as a result a gene totally corresponding to the whole ammo acid sequence of the desired protein as shown in the 
sequence listing under SEQ ID NO:4 can be derived from a group of cDNAs coding for respective portions of the pro- 
tein It is also possible to obtain a gene containing the whole of the cDNA in question, a gene containing the cDNA with 
deletion of a partial base sequence thereof, a gene containing the cDNA with insertion of some other base sequence, 
a gene containing the cDNA with substitution of some other base sequence for a partial base sequence of the cDNA, 
or the like gene from a variety of cDNA libraries by the PCR method using portions of the cDNA in question as probes. 
Such site-specific mutation, inclusive of base sequence deletion, addition or substitution, can be readily realized by the 
methods described in the literature (e.g. Methods in Enzymol., 2T7, 218 (1993); ibid., 217, 270 (1993)). 

The group of cDNAs obtained in the above manner are joined together so that the order of the base sequences is 
fit to the ammo acid sequence of the protein, to give a DNA fragment covering the whole region of the protein. The DNA 
fragment is inserted into a plasmid, such as pCDL-SRa296, at a site downstream from the promoter thereof and 
matched in phase with the translation initiation codon ATG, to thereby construct a protein expression vector Then, the 
protein can be expressed in a host, for example animal cells, transformed with the plasmid. Thereafter, the protein 
expressed can be recovered by purification by a conventional method. 

Thus, each of the thus-obtained cDNAs is inserted into a plasmid, such as pME18S, at a site downstream from the 
promoter thereof to thereby Construct a plasmid for protein expression. The protein or a protein derived therefrom by 
partial amino acid sequence deletion, insertion or substitution can be expressed in a host, such as animal cells, trans- 
formed with the expression plasmid. More concretely, CHO cells, COS cells, mouse L cells, mouse C127 cells, mouse 
FM3A cells and the like can be used as the animal cells for protein expression. When these animal cells are used as 
the host, the use, as a signal sequence, of that portion of the DNA base sequence shown in the sequence listing under 
SEQ ID NO:4, namely the gene for the protein, which starts with the 1st nucleotide and ends with the 35th nucleotide, 
or the use of an existing signal sequence is expected to be conducive to extracellular secretory production of the protein 
or production thereof on the cell membrane. 

The expression plasmid for use in animal cell hosts is constructed in the following manner. As the promoter, use 
can be made of any of the existing promoters, for example the SRa promoter, SV40 promoter or metallothionein gene 
promoter. A DNA containing the whole gene for the protein, inclusive of the above-mentioned signal-like sequence, a 
DNA containing the gene with a partial base sequence deletion, a DNA containing the gene with insertion of a base 
sequence or a DNA containing the gene with substitution of some other sequence for a partial base sequence thereof 
is inserted into a site downstream from the promoter in the direction of transcription. In constructing the expression 
plasmid for the protein, two or three pieces of the DNA fragment of the gene coding for the protein may be joined 
together and used for insertion downstream from the promoter. It is also possible to join such a promoter as the SV40 
promoter to the 5' upstream side of the DNA fragment of the gene coding for the protein to give a unit insert and insert, 
into a vector, two or three such units joined together in the same direction of transcription. A polyadenylation site is 
added to the downstream side of the gene coding for the protein. For example, the polyadenylation site derived from 
the SV40 DNA, p-globin gene or metallothionein gene can be joined to the downstream side of the gene coding for the 
protein. When a DNA fragment comprising a promoter and the gene coding for the protein as joined together is dupli- 
cated or triplicated, each unit may contain a polyadenylation site on the 3' side of the gene coding for the protein. In 
transforming animal cells, for example CHO cells, with such expression vector, a drug resistance gene can be used for 
the purpose of expression cell selection. As the drug resistance gene, there may be mentioned the DHFR gene which 
provides resistance to methotrexate (J. Mol, Biol , 153, 601 (1982)), the Neo gene which provides resistance to the anti- 
biotic G-418 (J. Mol. Appl Genet., 1, 327 (1982)), the Escherichia coli-derived Ecogpt gene which provides resistance 
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to mycophenolic acid (Proc Natl. Acad Sci USA., 78, 2072 (1981)) and the hph gene which provides resistance to 
the antibiotic hygromycm (Mol Cell Biol , 5, 410 (1985)), among others Each resistance gene contains a promoter, 
such as the above-mentioned SV40-denved promoter, inserted on the 5' upstream aide and a polyadenylation site as 
mentioned above on the 3' downstream side of each resistance gene In inserting such resistance gene into the expres- 
sion vector for the protein, the gene may be inserted at a site downstream from the polyadenylation site of the gene cod- 
ing for the protein, m either direction, the same or opposite. These expression vectors make it unnecessary to perform 
double transformation with another plasmid Containing a selective marker gene for the purpose of transformant isola- 
tion When the expression vector for the protein does not contain such a selective marker gene insert, a vector having 
a marker suited for transformant selection, for example pSV2neo (J Mol Appl Genet., 1, 327(1982)), pMBG (Nature, 
294, 228 (1981)), pSV2gpt (Proc. Natl. Acad. Sci. U.S.A., 78, 2072 (1981)) or pAd • D26 • 1 (J. Mol Biol., 159, 601 
(1982)), may be used, in combination with the expression vector for the gene coding for the protein, for transformation 
to thereby make it easy to perform transformant selection based on phenotypic expression of the drug resistance gene. 

The expression vector can be introduced into animal cells by the calcium phosphate method (Virology, 52, 456 
(1973)) or the electroporation method (J. Membr. Biol., l_Q, 279 (1972)), for instance. The transformed animal cells can 
be cultivated in the conventional manner in the manner of suspension culture or adhesion culture. They are cultivated 
in a medium such as MEM or RPMI 1640 in the presence of 5 to 10% of serum or in the presence of an appropriate 
amount of insulin, transferrin or the like, or under serum-free conditions. Further, it is also possible to produce the pro- 
tein using microorganisms such as yeants or Escherichia coli , for example strains of Saccharomyces cerevisiae or the 
strain Escherichia coli YA-21. since the cells express the protein in the culture supernatant or on the cell surface, it is 
possible to recover and purify the protein using the culture supernatant or cells of this transformant. More specifically, 
the protein can be isolated and purified by subjecting the culture supernatant or cell extract containing the protein pro- 
duced to an appropriate chromatography procedure, for example chromatographic treatment using hepann-Sepharose, 
ConA-Sepharose, hydroxyapatite and the like in combination. 

The protease inhibitor activity-endowed protein of the present invention has inhibitory activity on the protease activ- 
ity of HGF activator and, therefore, is useful as a in vitro or in vivo regulatory factor for HGF activator or, indirectly as a 
HGF activity regulating factor. The protein as well as an antibody to the protein or a gene coding for the protein is further 
useful as a tool or means for function analysis of the factors. 

Furthermore, by introducing an expression vector carrying a gone coding for the protein into animal cells, it 
becomes possible to produce part or the whole of the protein or a protein equivalent thereto, which is biologically active, 
in a stable manner and in large quantities. This has so far been difficult to attain. 

The present invention is now illustrated in greater detail with reference to the following Examples. However, it is not 
intended that the present invention be limited to these Examples. 

EXAMPLE 1 

(Purification of the protein using an MKN45 cell culture supernatant) 

MKN45 cells (Naito et al . Ganto Kagaku-ryoho (Cancer and Chemotherapy), 5, 89 (1978)) (obtained from Meneki 
Seibutsu Kenkyusho) were seeded into eRDF medium containing 5% FBS (fetal bovine serum) as placed m a roller bot- 
tle 850 and allowed to multiplicate until a confluent state was attained Then, the FBS-containmg culture supernatants 
was removed and the cells were washed with two portions of serum-free eRDF medium After removing the washing 
medium, 500 ml of serum-free eRDF medium was added and incubation was carried out at 37°C for 3 to 6 days After 
incubation, the culture supernatants was recovered, 500 ml of fresh serum-free eRDF medium was added, and incuba- 
tion was again conducted This procedure was repeated several times. The culture supernatants thus recovered were 
combined and concentrated about 20-fold using a YM30 ultrafiltration membrane (Amicon) 

This concentrate was submitted to a hepann-Sepharose column (equilibrated with PBS) and the non-adsorbed 
fraction was recovered This fraction was submitted to a ConA-Sepharose column (equilibrated with PBS) and sepa- 
rated into the non-adsorbed fraction and an adsorbed fraction eluted with a PBS solution containing 200 mM a-methyl- 
n.r^pnoside Th ? Con A non-adsorbed fraction was concentrated usmo V M30 followed bv buffer substitution to m 

, , . ■; : ■ n'r* ? a ■ ■ ■ : >i ■■ * L '"'1 ' , " ' " ' ■ ' - A * ■■ 
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: — ^ • protease nmoitor activity was tnus ■ecoverea 

"he fraction was dialysed against 20 mM Tns-hydrochlonde buffer fpH 8) containing 0 05% CHAPS and then sub- 
jected to HPLC using DEAE (equilibrated with 20 mM Tns-hydrochlonde buffer (pH 8) containing 0 05% CHAPS) foi- 
'owed bv Unear concentration gradient elution with 0 VI to 500 NaO whereby a fraction showing + he deseed 
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The fraction was submitted to GS-520 (equilibrated with PBS containing 0 05% CHAPS) and an active fraction (fraction 
of about 40 to 20 kDa) was recovered. For eliminating minor bands, the fraction was applied to a YMC pack C4 column 
(obtained from YMC), linear concentration gradient elution was carried out over 40 minutes using acetonitnle-isopropyl 
alcohol (3/7) containing 0 1% TFA and varying the concentration thereof from 10% to 50%, and the active fraction was 
neutralized with 1 M Tris-hydrochloride buffer (pH 8) and then dried under reduced pressure After drying, the solid 
obtained was dissolved in PBS containing 0.05% CHAPS to give a purified protein solution 

EXAMPLE 2 

(Ammo-terminal amino acid sequence and partial amino acid sequence determination of the protein) 

The protein having protease inhibitor activity as purified as in Example 1 and eluted by reversed phase HPLC was 
dried under reduced pressure without neutralization. This was dissolved in 60 ^l of 50% TFA (trrfluoroacetic acid), 
added to a polybrene-treated glass filter and subjected to Edman degradation on an Applied Biosystems model 470A 
sequencer, and the ammo acid sequence of an N-terminal region was determined. Phenylhydantoin (PTH)-ammo acids 
were identified using a Mitsubishi Chemical's MCI gel ODS IHU column (0.46 * 15 cm) and conducting single solvent 
elution with acetate buffer (10 mM acetate buffer (pH 4 7), 0.01% SDS, 38% acetonitnle) at aflow rate of i 2 ml/minute 
and a temperature of 43°C. PTH-amino acids were detected based on the absorbance at 269 nm 

As a result, the N-terminal amino acid sequence shown below in Table 1 was identified. 

Then, the same protein having protease inhibitor activity as purified as in Example 1 and eluted by reversed phase 
HPLC was dissolved in 100 ul of 50 mM Tris-hydrochloride buffer (pH 9.0) containing 4 M urea, lysyl endopeptidase 
(Achromobacter protease I) was added to the solution, and the reaction was carried out at 37°C for 8 hours. The result- 
ing peptide mixture was separated by reversed phase HPLC using a YMC pack C8 column (YMC) to give respective 
peptide fragments. Two peptides were subjected to amino acid analysis using a gaseous phase sequencer (Applied 
Biosystems model 1470A). The sequences shown in Table 1 were found. 

TABLE 1 

Ammo acid sequences of peptides 

N-terminal: Ala-Asp-Arg-Glu-Arg-Ser-lle-His-Asp-Phe-Xaa-Leu-Val-Ser-Lys (SEQ ID NO 1 in 
the sequence listing) 

Partial amino acid sequences 

1 : Lys-Val-Val-Gly-Arg-Xaa-Arg-Ala-Ser-Met-Pro-Arg-Trp-Trp-Tyr-Asn-Val- Thr-Asp-Gly-Ser- 
Xaa-Gln-Leu-Phe-Val-Tyr-Gly-Gly (SEQ ID NO;2 in the sequence listing) 
2: Ala-Thr-Val-Thr-Glu-Asn-Ala-Thr-Gly-Asp-Leu-Ala-Thr-Ser-Arg-Asn-Ala- Ala-Asp-Ser-Ser- 
Val-Pro-Ser-Ala-Pro (SEQ ID NO: 3 in the sequence listing) 

(Xaa: ammo acid residue not yet identified) 



EXAMPLE 3 

(Purification of the protein using an A549 cell culture supernatant and amino acid sequence analysis) 

A culture supernatant was prepared by cultivating A549 cells (obtained from the Japanese Cancer Research 
Resources Bank) in the same manner as in Example 1 Using the culture supernatant and proceeding in the same man- 
ner as in Example 1 , a protein having the inhibitory activity on the protease activity of HGF activator was obtained. Upon 
SDS-PAGE. this protein showed the same molecular weight as that derived from MKN45 cells. When subjected to the 
same N-terminal amino acid sequence determination as in Example 1, this protein gave the same sequence as that of 
the MKN45 cell-derived protein This suggested the possibility of the protein being identical with the MKN45-denved 
protein. 
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EXAMPLE 4 

(Method of assaying the activity of the protein inhibiting the protease activity of HGF activator as well as the activity) 

One to ten ul of the sample to be assayed was added to 30 to 40 ul of P6S-0.05% CHAPS solution containing 2 to 
5 ng of serum-derived HGF activator. After 30 minutes of incubation at 37°C, 5 to 10 ug of single chain HGF was added 
and incubation was further continued for 2 hours. This incubation mixture was subjected to SDS-polyacrylamide gel 
electrophoresis under reducing conditions After electrophoresis, Coomassie Brilliant Blue R250 (CBB) staining was 
performed and the proportions of single chain HGF and double chain HGF were compared for activity detection. 

The purified protein (10 ng) and 5 ng of serum-derived HGF activator were incubated in 30 to 40 ul of PBS-0 05% 
CHAPS solution at 37°C for 30 minutes, then 10 ug of single chain HGF was added, and incubation was further contin- 
ued for 2 hours. The incubation mixture was subjected to SDS-polyacrylamide gel electrophoresis under reducing con- 
ditions followed by staining with CBB. The results are shown tn Fig. 1 In the figure, the numeral 1 indicates the case 
where neither of HGF activator and the protein was added, 2 indicates the case where HGF activator was added but 
the protein was not added, and 3 indicates the case where HGF activator and the protein were added. Addition of the 
protein resulted in suppression of the activity of HGF activator converting single chain HGF to double chain HGF 

EXAMPLE 5 

(SDS-polyacrylamide gel electrophoresis) 

For determining the apparent molecular weight of the protein having protease inhibitor activity as purified from the 
MKN45 cell culture supernatant or A549 cell culture supernatant in Example 1 or Example 2, the protein was subjected 
to SDS-polyacrylamide gel electrophoresis. The protein finally purified was subjected to SDS-polyacrylamide gel elec- 
trophoresis using 12.5% polyacrylamide slab gels, which was conducted under nonreducmg conditions. The molecular 
weight markers used were Molecular weight markers "Daiichi" III for Laemmli method (Daiichi Pure Chemicals). After 
electrophoresis, color development was performed using a silver stain reagent (Kanto Chemical). Upon relative com- 
parison in migration distance between the protein and the molecular weight markers, the protein obtained from the 
MKN45 cell culture supernatant or A549 cell culture supernatant showed several fragments or a smear band, presum- 
ably due to differences in sugar chain, amino acid residue modification or terminal region, at positions around an appar- 
ent molecular weight of about 30,000 daltons as determined by SDS-polyacrylamide gel electrophoresis. 

EXAMPLE 6 

(Cloning of a gene coding for the protein and base sequence determination) 

The following two oligonucleotide primers were designed based on a presumption that Xaa will be Cys from the 
standpoint of the sequences Lys-Val-Val-Gly-Arg-Xaa-Arg and Xaa-Gln-Leu-Phe-Val-Tyr-Gly-Gly selected from the par- 
tial ammo acid sequence (SEQ ID N0 2 in the sequence listing) of the protein obtained tn Example 2 

Primer 1 5 -AAGGTNGTNGGNMGNTGYMG-3' (SEQ ID NO 5 in the sequence listing), and 
Primer 2 5 -CNCCGTANACGAANARYTGRC-3' (SEQ ID NO: 6 in the sequence listing). 

(In the above sequences. N indicates T or G, M indicates A or C, Y indicates T or C, and R indicates A or G.) 

Then, total RNA was prepared from MKN45 cells by the method described in Anal Biochem , 162, 156 (1987) and 
applied to on oligo(dT)-cellulose column, whereby poly(A)+RNA was obtained. 

Using the thus-obtained poly(A)+RNA as the template, RT-PCR (reverse transcnption-polymerse chain reaction, cf 
K Hayashi (ed ): 'PCR Ho no Saishin Gijutu (State-of-art Techniques of PCR)", page 44 and page 52, published Feb 
r « /-hH^q-' A qc -^mprj thp r Pac tion mtytijrp obtained bv this RT-PCR was analyzed bv roolyacrylamide 



t:acjmenT was ;ecoveteo ne oase sequence o? :re *:aqmen *db jet*-' '"e ,:ioeo* » ■ 

:ner this DNA fragment was labeled with - P by the method described in Motecuiar ocnmy ; ^oio oprmg Maroor -ac 
oratory, 1982) and used as a screening probe 

Using the MKN45 cell line-derived poly(A)+RNA together with a cDNA synthesis kit (Pharmacia). cDNA was syn- 
•-esized and a phage library was constructed as a library <<y screening with / ZAPU fStrategene) as the vector 
:. sc-ei -c^ia coi! < _ Biue ■ Strategene. 1 was °*ectec wtr *~e arx . ~ chage car y *c z -'e aoc^: ' 00 0CC aaues 

-tte r oversight culture p \Z V medium tne oactenai ceils a ere transferee to a 3ere Sceenng -Cs memD»a'--r 
du pont) T he membrane was placed on a filter paper impregnated with 0 ■ M sodium nydroxide-0 5 M Tms hydrocnio- 
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ride buffer (pH 7 5) and allowed to stand for 2 minutes and then placed on a filter paper impregnated with 1 5 M sodium 
chlonde-0 5 M Tris hydrochloride buffer (pH 7 5) and allowed to stand for 5 minutes. After two more repetitions of this 
series of treatments, the membrane was washed with 2 x SSC (two-fold concentrated SSC) and air-dried on a dry filter 
paper This membrane was irradiated with UV light at a dose of 120 mJ/cm 2 for fixation of the DNAs transferred to the 
membrane. The thus-treated membrane was immersed in 50 mi of a solution* comprising 50 mM Tris hydrochloride 
buffer (pH 7 5), 1 M sodium chloride and 1% SDS and maintained in that state at 65°C for 2 hours. Then, the membrane 
was immersed in 40 ml of a solution comprising 5 ng/ml of the above-mentioned 32 P-labeled probe, 100 ug/ml of 
salmon sperm, 50 mM Tris hydrochloride buffer (pH 7 5), 1 M sodium chloride and 1% SDS and maintained in that state 
at 65°C for 16 hours. Thereafter, this membrane was washed with 2 x SSC at room temperature over 5 minutes and 
then with two portions of 0 1 x SSC at room temperature over 30 minutes, and subjected to autoradiography, which gave 
22 positive clones supposedly containing cDNA for the protein. Then, each positive clone was submitted to plasmid 
construction by the excision method (Toyobo Life Science Catalog, pages 114-115) according to Toyobo Ufe Science 
Catalog (pages 114-115) and Strategene's manual 

The thus obtained plasmid DNAs were then cleaved with the restriction enzyme Eco RI and subjected to agarose 
gel electrophoresis, and a done with the longest cDNA for the protein inserted therein was selected. Then, by analyzing 
the base sequence of the plasmid (pHAI-ll) harbored by this clone, the whole base sequence of the gene coding for the 
protein was determined (SEQ ID NO 4 in the sequence listing). 

EXAMPLE 7 

(Preparation of an expression plasmid for the protein) 

A 10-ug portion of the plasmid (pHAI-ll) containing the cDNA for the protein as obtained in Example 6 was sub- 
jected to cleavage with the restriction enzyme EcoRI, followed by agarose gel electrophoresis. Thus, an about 14 kb 
EcoRI-EcoRI DNA fragment containing the cDNA for the protein was separated and extracted. The thus-obtained DNA 
fragment wag rendered blunt-ended by the conventional method using T4 DNA polymerase and, then, the DNA frag- 
ment was purified by phenol-chloroform extraction and ethanol precipitation and dissolved in 10 ul of water. Separately, 
0 05 ug of the expression vector pME18S (Medical Immunology, 2Q, 27 (1990)) was cleaved in advance with the restric- 
tion enzyme Xhol, and the DNA fragment obtained was rendered blunt-ended by the conventional method using T4 
DNA polymerase and then purified by phenol -chloroform extraction and ethanol precipitation. This was dissolved in 400 
u I of a 1 mM MgCI 2 solution in 50 mM Tris-HCI (pH 8), 1 unit of bacterial alkaline phosphatase (Toyobo, BAP-101) was 
added, and dephosphorylation treatment wag conducted at 65°C for 30 minutes. Then, the DNA fragment was purified 
from this reaction mixture by phenol-chloroform extraction and ethanol precipitation and dissolved in 10 pi of water. 
Ligation reaction was carried out in 20 ul of a reaction mixture (66 mM Tris-HCI, pH 7.6, 6.6 mM MgCI 2 , 10 mM dithio- 
threitol, 66 uM ATP) containing 0 01 ug of the pME18S vector-derived DNA fragment prepared as mentioned above and 
0.1 |ag of the above-mentioned blunt-ended EcoRI fragment of cDNA for the protein in the presence of T4 DNA ligase 
(Toyobo LGA-101) at 14°C for 12 hours. A 10-ul portion of this T4 DNA ligase reaction mixture was used to transform 
Escherichia coli HB101 (Takara Shuzo) according to the manual attached thereto. The microorganism was cultured on 
a medium containing 50 ug/ml of ampicillin and scores of ampicillin-resistant strains were obtained. These transform- 
ants were analyzed by the method of Maniatis et al. ("Molecular Cloning", Cold Spring Harbor Laboratory, pages 86-96 
(1982)) and. as a result, a plasmid, pME18S-HAI-ll, containing the gene coding for the protein as inserted at the Xho l 
restriction enzyme cleavage site occurring between the promoter and polyadenylation site of the expression vector 
pMEl8S could be obtained. Its structure is shown in Fig. 2. 

EXAMPLE 8 

(Obtaining of a cell line expressing the protein) 

The plasmid pMEl8S-HAI-ll constructed in Example 7 and containing the cDNA for the protein as inserted at the 
Xho l restriction enzyme cleavage site of the expression vector pME18S was recovered and purrfied from the recom- 
binant Escherichia coli by the method of Maniatis et al. ("Molecular Cloning", Cold Spring Harbor Laboratory, pages 86- 
96 (1982)) and thus a large amount of the expression plasmid DNA for the protein was obtained. COS cells were trans- 
formed by transfection thereof with the expression plasmid DNA. Thus, COS cells were first cultured in eRDF medium 
containing 10% FBS (total bovine serum) in tissue culture dishes 9 cm in diameter until a semiconfluent condition 
Then, the medium was removed from the dishes, and a DNA solution prepared as mentioned below was added drop- 
wise thereto as mentioned below. First, for each dish 9 cm in diameter, a solution was prepared in an Eppendorf cen- 
trifuge tube by adding thereto 300 ul of 2 x HEBS solution (2 x HEBS solution: 1 6% sodium chloride, 0.074% potassium 
chloride. 0.05% disodium hydrogen phosphate dodecahydrate, 0.2% dextrose, 1% HEPES (pH 7 05)) and 10 ug of the 
plasmid DNA and making the volume 570 ul with sterilized water. Then, while adding 30 ul of 2 5 M calcium chloride 
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solution to the DNA solution, the tube contents were stirred vigorously using a vortex mixer for several seconds The 
resulting mixture was allowed to stand at room temperature for 30 minutes, with occasional stirring at intervals of about 
1 0 minutes using a vortex mixer. The thus-prepared DNA solution was laid on the cells mentioned above and the whole 
was allowed to stand at room temperature for 30 minutes Then, 9 ml of eRDF medium (Kyokuto Pharmaceutical) sup- 
plemented with 10% FBS was added to each dish and incubation was performed at 37°C for 4 to 5 hours m the pres- 
ence of 5% C0 2 Then, the medium was removed from each dish and the cells were washed with 5 ml of 1 x TBS++ 
solution (1 x TBS++ solution: 25 mM Tris-hydrochloride (pH 7 5), 140 mM sodium chloride, 5 mM potassium chloride, 

0 6 mM disodium hydrogen phosphate, 0 08 mM calcium chloride, 0.08 mM magnesium chloride) After removing the 

1 x TBS++ solution, the cells were covered with 5 ml/dish of 1 x HEBS solution containing 10% DMSO (dimethyl sulfox- 
ide) and allowed to stand at room temperature for 1 to 2 minutes The supernatant was then removed The cells were 
again washed with 5 ml of 1 x TBS++ solution, 10 ml of eRDF medium supplemented with 10% FBS was added to each 
dish and incubation was performed at 37°C in the presence of 5% CO? After the lapse of 48 hours, the medium was 
recovered. The supernatant recovered was 20-fold concentrated and assayed for inhibitory activity against HGF activa- 
tor in the same manner as in Example 4 The inhibitory activity was confirmed. 

While the invention has been described in detail and with reference to specrfic embodiments thereof, it will be 
apparent to one skilled in the art that various changes and modifications can be made therein without departing from 
the spirit and scope thereof. 
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SEQUENCE LISTING 
INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 15 amino acids 

(B) TYPEt amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MXN45 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Ala Asp Arg Glu Arg Ser lie His Asp Phe Xaa Leu Val Ser Lys 
15 10 15 

xaa: unknown amino acid 
INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 

(X) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
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Lys Val Val Gly Arg xaa Arg Ala Ser Met Pro Arg Trp Trp Tyr Asn 

5 10 15 

Val Thr Asp Gly Ser Xaa Gin Leu Phe Val Tyr Gly Gly 
20 25 
xaa t unknown amino acid 
INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) lengths 26 amino acids 

/ B N TVDf • »*w4«*% A 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPEs peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 

(X) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Ala Thr Val Thr Glu Asn Ala Thr Gly Asp Leu Ala Thr Ser Arg Asn 

1 5 10 15 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 

20 25 
INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS i double 
TOPOLOGY r Mr. ear 
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(ii) MOLECULE TYPE: cDNA to mRNA 

(iv) ANTISENSE: no 

(v) ORIGINAL SOURCE t 

(A) ORGANISM: Homo sapiens 

(B) STRAIN j MXN45 

( ix ) FEATURES s 

(A) NAME /KEY j coding sequence 

(B) LOCATION: 1 to 759 

(C) IDENTIFICATION METHOD: by experiment 

(A) NAME/KEY: signal peptide 

(B) LOCATION: 1 to 81 

(C) IDENTIFICATION METHOD : by experiment 

(A) NAME/KEY: mature peptide 

(B) LOCATION: 82 to 759 

(C) IDENTIFICATION METHOD: by experiment 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

ATG GCG CAG CTG TGC GGG CTG AGG CGG AGC CGG GCG TTT CTC GCC CTG 4 8 

Ket Ala Gin Leu Cy« Gly Leu Arg Arg Ser Arg Al» Phe Leu Ale Ltu 

IS 10 13 

CTG GGA TCG CTG CTC CTC TCT GGG GTC CTG GCG GCC GAC CGA GAA CGC 96 
Leu Gly Ser Leu Leu Leu Ser Gly v«l Leu Alt Ale Aep Arg Glu Arg 

20 25 30 

AGC ATC CAC GAC TTC TGC CTG GTG TCG AAG GTG GTG GGC AGA TGC CGG 144 
Ser lie Hie A«p Phe Cyi Leu Vel Ser Lys v«l Vil Gly Arg Cyi Arg 
35 40 45 
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GCC TCC ATG CCT AGG i TGG TGG TAC AAT GTC ACT GAC GGA TCC TGC CAG 192 
Alt Ser Met Pro Arg Trp Trp Tyr Atn Vtl Thr Atp Gly Ser Cyt Gin 

50 35 60 

CTC TTT CTG TAT CGG GGC TGT GAC GGA AAC AGC AAT AAT TAC CTG ACC 240 
Leu Phe Vtl Tyr Cly Gly Cyt Atp Gly Am Ser Atn Atn Tyr Leu Thr 
65 70 73 80 

AAG GAG GAG TGC CTC AAG AAA TGT GCC ACT CTC ACA GAG AAT GCC ACG 288 
Lyt Glu Glu Cyt Leu Lyt Lyi Cyt All Thr Vtl Thr Olu Am Alt Thr 

85 90 95 

GGT GAC CTG GCC ACC AGC ACG AAT GCA GCG GAT TCC TCT GTC CCA AGT 3 36 

Gly Atp Leu Alt Thr Ser Arg Am Alt Alt Asp Ser Ser Vtl Pro Scr 

100 105 110 

GCT CCC AGA AGG CAG GAT TCT GAA GAC CAC TCC AGC GAT ATG TTC AAC 384 
Alt Pro Arg Arg Gin Atp Ser Glu Atp Hit Ser Str Atp Met Phe Atn 

115 120 125 

TAT GAA GAA TAC TGC ACC GCC AAC GCA GTC ACT GCG CCT TGC CGT GCA 4 32 
Tyr Glu Glu Tyr Cyt Thr Alt Ain Alt Vtl Thr Gly Pro Cyt Arg Alt 

130 135 140 

TCC TTC CCA CCC TGG TAC TTT GAC GTG GAG AGG AAC TCC TGC AAT AAC 4 80 

Ser Phe Pro Arg Trp Tyr Phe Asp Vtl Glu Arg Atn Ser Cyt Asn Asn 
145 150 155 160 

TTC ATC TAT GGA GGC TGC CGG GGC AAT AAG AAC AGC TAC CGC TCT GAG 528 
Phe He Tyr Gly Gly Cyt Arg Cly Asn Lyt Atn Ser Tyr Arg Ser Glu 
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CAC CCC TCC ATC CTC CCC TCC TTC CCC CAC CAC CAC AAT CCT CCC CTC 5 76 

Olu Ala Cyt Met Leu Aig C/8 ?he Arg Gin Gin Glu Asn Pre Pro Leu 

i80 185 190 

CCC CTT CCC TCA AAG GTC CTC CTT CTC CCC CCC CTG TTC CTC ATC CTC 624 
Pro Leu Cly Scr Lys Val Val Val Leu Ala Gly Leu ?he Val Met Val 

1*3 200 205 

TTC ATC CTC TTC CTC CCA CCC TCC ATC GTC TAC CTC ATC CCC GTC CCA 672 
Leu lie Leu The Leu Gly Ala Ser Met Val Tjr Leu lie Arg Val Ala 

210 213 220 

CCC ACG AAC CAC CAC- CCT CCC CTC CCC ACC CTC TCC ACC TCC CCA CAT 720 
Arg Arg Aan Gin Glu Ar& Ala Leu Arg Thr Val Tr? Ser Sec Gly Asp 
225 230 235 240 

CAC AAC CAC CAG CTC CTC AAC AAC ACA TAT CTC CTC TCA 760 
Atp Lya Glu Gin Leu Val Lys A»n Thr Tyr Val Leu * 

245 250 
INFORMATION FOR SEQ ID NO: 5 



(i) SEQUENCE CHARACTERISTICS; 

35 (A) length 2 20 nucleotides 

( B ) TYPE i nucleic acid 

(C) STRANDEDNESS; single stranded 
40 (D ) TOPOLOGY: linear 

(ii) MOLECULE TYPEi other nucleic acid, synthetic DNA 
(x) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

45 



AAGGTNGTNG GNMGNTGYKG 

Nt T or G, M: A or C, Y • T or C, And Pt A or G 
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INFORMATION FOR SEQ ID NCi 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs tingle stranded 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: other nucleic acid, synthetic DNA 
(X) SEQUENCE DESCRIPTION! SEQ ID NO: 6 

CNCCGTANAC GAANARYTGR C 
20 N: T or G, Mt A or C, Y: T or C, and Rt A or G 

25 

3C 
35 



45 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MITSUBISHI CHEMICAL CORPORATION 

(B) STREET: 5-2, Marunouchi 2-chome, Chiyoda-ku 

(C) CITY: Tokyo 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP): none 



(ii) TITLE OF INVENTION: NOVEL PROTEIN, DNA CODING FOR SAME 
AND METHOD OF PRODUCING THE PROTEIN 



(iii) NUMBER OF SEQUENCES: 7 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 96 111 861.9 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N- terminal 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Asp Arg Glu Arg Ser He His Asp Phe Xaa Leu Val Ser Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Val Val Gly Arg Xaa Arg Ala Ser Met Pro Arg Trp Trp Tyr Asn 
15 10 15 

Val Thr Asp Gly Ser Xaa Gin Leu Phe Val Tyr Gly Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



3C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Thr Val Thr Glu Asn Ala Thr Gly Asp Leu Ala Thr Ser Arg Asn 
15 10 15 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA to mRNA 
(iv) ANTI-SENSE: NO 



L?0 758 682 A2 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
( B } LOCATION: 1 756 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence^ EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION : 1 . .81 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 82. .756 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG GCG CAG CTG TGC GGG CTG AGG CGG AGC CGG GCG TTT CTC GCC CTG 48 

Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 

-27 -25 -20 -15 

CTG GGA TCG CTG CTC CTC TCT GGG GTC CTG GCG GCC GAC CGA GAA CGC 96 

Leu Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 
-10 -5 1 5 

AGC ATC CAC GAC TTC TGC CTG GTG TCG AAG GTG GTG GGC AGA TGC CGG 144 
Ser He His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 
10 15 20 

GCC TCC ATG CCT AGG TGG TGG TAC AAT GTC ACT GAC GGA TCC TGC CAG 192 
Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 
25 30 35 

CTG TTT GTG TAT GGG GGC TGT GAC GGA AAC AGC AAT AAT TAC CTG ACC 2 40 

Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

AAG GAG GAG TGC CTC AAG AAA TGT GCC ACT GTC AC A GAG AAT GCC ACG 2 88 

Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 

GGT GAC CTG GCC ACC AGC AGG AAT GCA GCG GAT TCC TCT GTC CCA AGT 3 36 

ao Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp Ser Ser Val Pro Ser 

70 75 80 85 

GCT CCC AGA AGG CAG GAT TCT GAA GAC CAC TCC AGC GAT ATG TTC AAC 384 

Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser Ser Asp Met Phe Asn 

90 95 100 

45 

TAT GAA GAA TAC TGC ACC GCC AAC GCA GTC ACT GGG CCT TGC CGT GCA 4 32 

Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 

105 110 115 



35 
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20 



TCC TTC CCA CGC TGG TAC TTT GAC GTG GAG AGG AAC TCC TGC AAT AAC 4 80 

Ser Phe Pro Arg Trp Tyr Phe Asp Val Glu Arg Asn Ser Cys Asn Asn 
120 125 130 

TTC ATC TAT GGA GGC TGC CGG GGC AAT AAG AAC AGC TAC CGC TCT GAG 52 8 

Phe lie Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

GAG GCC TGC ATG CTC CGC TGC TTC CGC CAG CAG GAG AAT CCT CCC CTG 57 6 

Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

CCC CTT GGC TCA AAG GTG GTG GTT CTG GCG GGG CTG TTC GTG ATG GTG 624 
Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

TTG ATC CTC TTC CTG GGA GCC TCC ATG GTC TAC CTG ATC CGG GTG GCA 672 
Leu lie Leu Phe Leu Gly Ala Ser Met Val Tyr Leu lie Arg Val Ala 
185 190 195 

CGG AGG AAC CAG GAG CGT GCC CTG CGC ACC GTC TGG AGC TCC GGA GAT 7 20 

Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

GAC AAG GAG CAG CTG GTG AAG AAC ACA TAT GTC CTG TGA 7 59 

Asp Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 
215 220 225 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 252 amino acids 
3C (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 
-27 -25 -20 -15 

Leu Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 

-10 -5 1 5 

4C Ser He His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 

10 15 20 

Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 
25 30 35 

45 Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 



25 
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Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp Ser Ser Val Pro Ser 
70 75 80 85 

Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser Ser Asp Met Phe Asn 
90 95 100 

Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 
105 110 115 

Ser Phe Pro Arg Trp Tyr Phe Asp Val Glu Arg Asn Ser Cys Asn Asn 
120 125 130 

Phe He Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

Leu He Leu Phe Leu Gly Ala Ser Met Val Tyr Leu He Arg Val Ala 
185 190 195 

Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

Asp Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 
215 220 225 

(2) INFORMATION FOR SEQ ID NO: 6: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA M 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION :group( 6, 9, 12, 15) 

(D) OTHER INFORMATION: /note= "N is T or G " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGGTNGTNG GNMGNTGYMG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION : group ( 2 , 8, 14) 

(D) OTHER INFORMATION: /note= "N is T or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CNCCGTANAC GAANARYTGR C 21 



uiaims 



20 1 . A protein having the following physico-chemical properties: 

(1) a molecular weight of about 30,000 daltons as determined by SDS-polyacrylamide gel electrophoresis; 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino acid sequences depicted in the sequence listing under SEQ ID NO:1 through 3 or an 
25 ammo acid sequence substantially equivalent thereto. 

2. A protein having one of the amino acid sequences depicted in the sequence listing under SEQ ID NO:1 through 3 
or an amino acid sequence substantially equivalent thereto and having inhibitory activity on the protease activity of 
hepatocyte growth factor activator. 

3C 

3. A protein having the amino acid sequence depicted in the sequence listing under SEQ ID NO:4 or an ammo acid 
sequence substantially equivalent thereto. 

4. A protein having, as its amino acid sequence, that segment of the ammo acid sequence depicted in the sequence 
35 listing under SEQ ID NO:4 which starts with the 28th amino acid (alanine) residue and ends with the 252nd ammo 

acid (leucine) residue, or an amino acid sequence substantially equivalent thereto. 

5. A DNA coding for the protein of Claim 1 . 
*c 6. A DNA coding for the protein of Claim 2 

7. A DNA coding for the protein of Ciaim 3 

8. A DNA as claimed in Claim 7 which is represented by the base sequence depicted in the sequence listing under 
45 SEQ ID NO:4 or a base sequence substantially equivalent thereto. 

9. A DNA coding for the protein of Claim 4 

1 0. A DNA as claimed in Claim 9 which is represented by that segment of the base sequence depicted m the sequence 



1 2. A gene coding for the protein of Claim 2 



14. - je r ^e as : a r ^ec " - a ^ * ; cr s ■ec^ser'teo ~y oase ^ec.e'xe lecctec ~ s* ":~ 
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SEQ ID NO 4 or a base sequence substantially equivalent thereto 

1 5. A gene coding for the protein of Claim 4. 

16. A gene as claimed in Claim 15 which is represented by that segment of the base sequence depicted in the 
sequence listing under SEQ ID NO:4 which starts with the 82nd nucleotide (guanine) and ends with the 759th 
nucleotide (adenine), or a base sequence substantially equivalent thereto. 

17. An expression vector which comprises the DNA or gene of one of Claims 5 through 16. 

18. A transformant as obtained by transformation of a host cell with the expression vector of Claim 1 7. 

19. A transformant as claimed in Claim 18, wherein the host ceil is an animal cell 

20. A method of producing a protein having inhibitory activity on the protease activity of hepatocyte growth factor acti- 
vator which comprises cultivating the transformant of Claim 18 or 19 to thereby produce the protein having inhibi- 
tory activity on the protease activity of hepatocyte growth factor activator. 



21. 



A method of protein production as claimed in Claim 20, wherein the protein is a protein defined in one of Claims 1 
through 4. 
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Fig. 2 
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